An innovative F0 modeling approach for emphatic affirmative speech, applied to the Greek language

نویسندگان

  • Georgios P. Giannopoulos
  • Aimilios E. Chalamandaris
چکیده

Prosody generation engine which is is responsible for the naturalness of the synthetic speech, remains one of the most important component of a Text-to-Speech synthesis system. In this paper we present an innovative algorithm for modelling the fundamental frequency F0 for the Greek language, for sentences containing emphatic segments. The main idea of our approach is the definition of a specific set of intonation word models, derived from a spoken corpus, the use of which is sufficient in modeling the pitch contour of arbitrary long sentences similarly structured. Our method is based on a prosodic unit selection approach. This is tested to ILSP’s TtS system for the Greek language Ekfonitis+ [1], which is customized to utter weather reports with virtually natural synthetic voice. The system was designed and trained on a spoken corpus of 120 naturally uttered sentences of weather forecasts, containing emphasis segments and has proved to be very efficient in coping with similarly structured sentences. In the first section of the paper we present a brief review of the existing literature on this field, in addition with analogous approaches for other languages. In the second section we present our method and the design procedure. The last two sections contain the preliminary results acquired from our experiments as well as conclusions and refer to future work that needs to be carried out.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic adaptations to pitch perturbation in running speech.

PURPOSE A feedback perturbation paradigm was used to investigate whether prosodic cues are controlled independently or in an integrated fashion during sentence production. METHOD Twenty-one healthy speakers of American English were asked to produce sentences with emphatic stress while receiving real-time auditory feedback of their productions. The fundamental frequency (F0) of the stressed wo...

متن کامل

Tone, intonation, and emphatic stress in L2 Mandarin speech by English and Cantonese learners

On the basis of a set of well-controlled sentences varying in sentence type, tone identity, and focus position, the present study compared F0 characteristics of Mandarin speech between native speakers and two groups of L2 learners whose native languages were Cantonese and English, respectively. The results showed systematically that most L2 errors in the F0 manifestations of tone, intonation, a...

متن کامل

Structural Modeling of Fundamental Frequency Contour for Thai Expressive Speech

Problem statement: Appropriate modeling of fundamental Frequency (F0) contour for speech is a key factor to preserve the quality of speech prosody. One successful approach has been conducted for tonal language of Mandarin Chinese. It is based on the assumption that the behavioral characteristics of vocal-fold elongation in vibration could be approximated by those of a simple forced vibrating sy...

متن کامل

English for Specific Purposes: Proposing an Innovative Approach to Teaching English to Medical Students

Background: English for Specific Purposes (ESP) courses should empower students to satisfy their language needs. However, at many Iranian universities, these courses are mostly taught based on the Grammar Translation Method focusing on teaching grammatical rules and reading comprehension. The present study aims at proposing an innovative approach for teaching English to Medical students that ca...

متن کامل

Modeling of Fundamental Frequency Contour of Thai Expressive Speech using Fujisaki’s Model and Structural Model

Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplishe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006